Model Selection

Multi-Precision Quantization

# Multi-Precision Quantization

Deepseek R1 Llama 8B F32 GGUF

DeepSeek-R1-Llama-8B-F32-GGUF is the quantized version of DeepSeek-R1-Distill-Llama-8B, trained directly with reinforcement learning, featuring capabilities such as self-verification, reflection, and generating extended chain-of-thought reasoning.

Large Language Model

Transformers English

Microsoft Phi 4 Reasoning GGUF

This is a quantized version of Microsoft's Phi-4-reasoning model, optimized using llama.cpp for inference tasks and supporting multiple quantization options.

Large Language Model

Google Gemma 3 12b It Qat GGUF

Gemma-3-12b model based on Google QAT (Quantization-Aware Training) weight quantization, offering multiple quantized versions to accommodate different hardware requirements.

Large Language Model

FLUX.1 Redux Dev GGUF

FLUX.1-Redux-dev is a text-to-image generation model based on the FLUX technology stack, supporting the English language and adopting a non-commercial license.

Text-to-Image English

Bge Base En V1.5 Gguf

This project provides the BGE embedding model stored in GGUF format, which is suitable for use with llama.cpp and offers better performance than transformers.

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase